Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework

Identifieur interne : 000187 ( Main/Exploration ); précédent : 000186; suivant : 000188

Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework

Auteurs : RONG HUANG [Japon] ; Palaiahnakote Shivakumara [Japon] ; YAOKAI FENG [Japon] ; Seiichi Uchida [Japon]

Source :

RBID : Pascal:13-0328463

Descripteurs français

English descriptors

Abstract

To handle the variety of scene characters, we propose a cooperative multiple-hypothesis framework which consists of an image operator set module, an Optical Character Recognition (OCR) module and an integration module. Multiple image operators activated by multiple parameters probe suspected character regions. The OCR module is then applied to each suspected region and returns multiple candidates with weight values for future integration. Without the aid of the heuristic rules which impose constraints on segmentation area, aspect ratio, color consistency, text line orientations, etc., the integration module automatically prunes the redundant detection/recognition and pads the missing detection/recognition. The proposed framework bridges the gap between scene character detection and recognition, in the sense that a practical OCR engine is effectively leveraged for result refinement. In addition, the proposed method achieves the detection and recognition at the character level, which enables dealing with special scenarios such as single character, text along arbitrary orientations or text along curves. We perform experiments on the benchmark ICDAR 2011 Robust Reading Competition dataset which includes a text localization task and a word recognition task. The quantitative results demonstrate that multiple hypotheses outperform a single hypothesis, and be comparable with state-of-the-art methods in terms of recall, precision, F-measure, character recognition rate, total edit distance and word recognition rate. Moreover, two additional experiments are conducted to confirm the simplicity of parameter setting in this proposal.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework</title>
<author>
<name sortKey="Rong Huang" sort="Rong Huang" uniqKey="Rong Huang" last="Rong Huang">RONG HUANG</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Yaokai Feng" sort="Yaokai Feng" uniqKey="Yaokai Feng" last="Yaokai Feng">YAOKAI FENG</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">13-0328463</idno>
<date when="2013">2013</date>
<idno type="stanalyst">PASCAL 13-0328463 INIST</idno>
<idno type="RBID">Pascal:13-0328463</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000042</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000726</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000030</idno>
<idno type="wicri:doubleKey">0916-8532:2013:Rong Huang:scene:character:detection</idno>
<idno type="wicri:Area/Main/Merge">000190</idno>
<idno type="wicri:Area/Main/Curation">000187</idno>
<idno type="wicri:Area/Main/Exploration">000187</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework</title>
<author>
<name sortKey="Rong Huang" sort="Rong Huang" uniqKey="Rong Huang" last="Rong Huang">RONG HUANG</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Yaokai Feng" sort="Yaokai Feng" uniqKey="Yaokai Feng" last="Yaokai Feng">YAOKAI FENG</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">IEICE transactions on information and systems</title>
<title level="j" type="abbreviated">IEICE trans. inf. syst.</title>
<idno type="ISSN">0916-8532</idno>
<imprint>
<date when="2013">2013</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">IEICE transactions on information and systems</title>
<title level="j" type="abbreviated">IEICE trans. inf. syst.</title>
<idno type="ISSN">0916-8532</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Aspect ratio</term>
<term>Character recognition</term>
<term>Heuristic method</term>
<term>Localization</term>
<term>Multiple image</term>
<term>Open market</term>
<term>Operator</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Performance evaluation</term>
<term>Refinement method</term>
<term>Segmentation</term>
<term>Speech recognition</term>
<term>State of the art</term>
<term>Voting</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Reconnaissance caractère</term>
<term>Opérateur</term>
<term>Reconnaissance optique caractère</term>
<term>Image multiple</term>
<term>Méthode heuristique</term>
<term>Segmentation</term>
<term>Rapport aspect</term>
<term>Méthode raffinement</term>
<term>Marché concurrentiel</term>
<term>Localisation</term>
<term>Reconnaissance parole</term>
<term>Evaluation performance</term>
<term>Etat actuel</term>
<term>Vote</term>
<term>Reconnaissance forme</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Vote</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">To handle the variety of scene characters, we propose a cooperative multiple-hypothesis framework which consists of an image operator set module, an Optical Character Recognition (OCR) module and an integration module. Multiple image operators activated by multiple parameters probe suspected character regions. The OCR module is then applied to each suspected region and returns multiple candidates with weight values for future integration. Without the aid of the heuristic rules which impose constraints on segmentation area, aspect ratio, color consistency, text line orientations, etc., the integration module automatically prunes the redundant detection/recognition and pads the missing detection/recognition. The proposed framework bridges the gap between scene character detection and recognition, in the sense that a practical OCR engine is effectively leveraged for result refinement. In addition, the proposed method achieves the detection and recognition at the character level, which enables dealing with special scenarios such as single character, text along arbitrary orientations or text along curves. We perform experiments on the benchmark ICDAR 2011 Robust Reading Competition dataset which includes a text localization task and a word recognition task. The quantitative results demonstrate that multiple hypotheses outperform a single hypothesis, and be comparable with state-of-the-art methods in terms of recall, precision, F-measure, character recognition rate, total edit distance and word recognition rate. Moreover, two additional experiments are conducted to confirm the simplicity of parameter setting in this proposal.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Japon</li>
</country>
<region>
<li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement>
<li>Fukuoka</li>
</settlement>
<orgName>
<li>Université de Kyūshū</li>
</orgName>
</list>
<tree>
<country name="Japon">
<region name="Kyūshū">
<name sortKey="Rong Huang" sort="Rong Huang" uniqKey="Rong Huang" last="Rong Huang">RONG HUANG</name>
</region>
<name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<name sortKey="Yaokai Feng" sort="Yaokai Feng" uniqKey="Yaokai Feng" last="Yaokai Feng">YAOKAI FENG</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000187 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000187 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:13-0328463
   |texte=   Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024